Overview

Dataset statistics

Number of variables23
Number of observations129880
Missing cells393
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.8 MiB
Average record size in memory184.0 B

Variable types

Categorical6
Numeric17

Alerts

Seat comfort is highly correlated with Food and drinkHigh correlation
Departure/Arrival time convenient is highly correlated with Food and drink and 1 other fieldsHigh correlation
Food and drink is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Gate location is highly correlated with Departure/Arrival time convenient and 1 other fieldsHigh correlation
Inflight wifi service is highly correlated with Online support and 2 other fieldsHigh correlation
Online support is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
Ease of Online booking is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
On-board service is highly correlated with Baggage handling and 1 other fieldsHigh correlation
Baggage handling is highly correlated with On-board service and 1 other fieldsHigh correlation
Cleanliness is highly correlated with On-board service and 1 other fieldsHigh correlation
Online boarding is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Seat comfort is highly correlated with Food and drinkHigh correlation
Departure/Arrival time convenient is highly correlated with Food and drink and 1 other fieldsHigh correlation
Food and drink is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Gate location is highly correlated with Departure/Arrival time convenient and 1 other fieldsHigh correlation
Inflight wifi service is highly correlated with Online support and 2 other fieldsHigh correlation
Online support is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
Ease of Online booking is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
On-board service is highly correlated with Baggage handling and 1 other fieldsHigh correlation
Baggage handling is highly correlated with On-board service and 1 other fieldsHigh correlation
Cleanliness is highly correlated with On-board service and 1 other fieldsHigh correlation
Online boarding is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Seat comfort is highly correlated with Food and drinkHigh correlation
Departure/Arrival time convenient is highly correlated with Gate locationHigh correlation
Food and drink is highly correlated with Seat comfortHigh correlation
Gate location is highly correlated with Departure/Arrival time convenientHigh correlation
Inflight wifi service is highly correlated with Ease of Online booking and 1 other fieldsHigh correlation
Online support is highly correlated with Ease of Online booking and 1 other fieldsHigh correlation
Ease of Online booking is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
On-board service is highly correlated with CleanlinessHigh correlation
Baggage handling is highly correlated with CleanlinessHigh correlation
Cleanliness is highly correlated with On-board service and 1 other fieldsHigh correlation
Online boarding is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Class is highly correlated with Type of TravelHigh correlation
Type of Travel is highly correlated with ClassHigh correlation
satisfaction is highly correlated with Seat comfort and 4 other fieldsHigh correlation
Seat comfort is highly correlated with satisfaction and 4 other fieldsHigh correlation
Departure/Arrival time convenient is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Food and drink is highly correlated with Seat comfort and 3 other fieldsHigh correlation
Gate location is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Inflight wifi service is highly correlated with Online support and 2 other fieldsHigh correlation
Inflight entertainment is highly correlated with satisfaction and 4 other fieldsHigh correlation
Online support is highly correlated with satisfaction and 5 other fieldsHigh correlation
Ease of Online booking is highly correlated with satisfaction and 7 other fieldsHigh correlation
On-board service is highly correlated with satisfaction and 4 other fieldsHigh correlation
Leg room service is highly correlated with Ease of Online booking and 2 other fieldsHigh correlation
Baggage handling is highly correlated with Ease of Online booking and 2 other fieldsHigh correlation
Checkin service is highly correlated with Online supportHigh correlation
Cleanliness is highly correlated with Ease of Online booking and 3 other fieldsHigh correlation
Online boarding is highly correlated with Inflight wifi service and 3 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Seat comfort has 4797 (3.7%) zeros Zeros
Departure/Arrival time convenient has 6664 (5.1%) zeros Zeros
Food and drink has 5945 (4.6%) zeros Zeros
Inflight entertainment has 2978 (2.3%) zeros Zeros
Departure Delay in Minutes has 73356 (56.5%) zeros Zeros
Arrival Delay in Minutes has 72753 (56.0%) zeros Zeros

Reproduction

Analysis started2021-11-25 15:02:42.622227
Analysis finished2021-11-25 15:04:06.061807
Duration1 minute and 23.44 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

satisfaction
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1014.8 KiB
satisfied
71087 
dissatisfied
58793 

Length

Max length12
Median length9
Mean length10.35801509
Min length9

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowsatisfied
2nd rowsatisfied
3rd rowsatisfied
4th rowsatisfied
5th rowsatisfied

Common Values

ValueCountFrequency (%)
satisfied71087
54.7%
dissatisfied58793
45.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
satisfied71087
54.7%
dissatisfied58793
45.3%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1014.8 KiB
Female
65899 
Male
63981 

Length

Max length6
Median length6
Mean length5.014767478
Min length4

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowMale
3rd rowFemale
4th rowFemale
5th rowFemale

Common Values

ValueCountFrequency (%)
Female65899
50.7%
Male63981
49.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
female65899
50.7%
male63981
49.3%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Customer Type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1014.8 KiB
Loyal Customer
106100 
disloyal Customer
23780 

Length

Max length17
Median length14
Mean length14.54927626
Min length14

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLoyal Customer
2nd rowLoyal Customer
3rd rowLoyal Customer
4th rowLoyal Customer
5th rowLoyal Customer

Common Values

ValueCountFrequency (%)
Loyal Customer106100
81.7%
disloyal Customer23780
 
18.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
customer129880
50.0%
loyal106100
40.8%
disloyal23780
 
9.2%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Age
Real number (ℝ≥0)

Distinct75
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.42795658
Minimum7
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum7
5-th percentile15
Q127
median40
Q351
95-th percentile64
Maximum85
Range78
Interquartile range (IQR)24

Descriptive statistics

Standard deviation15.11935995
Coefficient of variation (CV)0.3834680076
Kurtosis-0.7191402272
Mean39.42795658
Median Absolute Deviation (MAD)12
Skewness-0.003606211745
Sum5120903
Variance228.5950453
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
393692
 
2.8%
253511
 
2.7%
403209
 
2.5%
443104
 
2.4%
413089
 
2.4%
423017
 
2.3%
432941
 
2.3%
452939
 
2.3%
232935
 
2.3%
222931
 
2.3%
Other values (65)98512
75.8%
ValueCountFrequency (%)
7685
0.5%
8797
0.6%
9859
0.7%
10822
0.6%
11837
0.6%
12794
0.6%
13806
0.6%
14860
0.7%
151006
0.8%
161156
0.9%
ValueCountFrequency (%)
8525
 
< 0.1%
80110
0.1%
7952
 
< 0.1%
7844
 
< 0.1%
77106
0.1%
7660
 
< 0.1%
7576
 
0.1%
7461
 
< 0.1%
7367
 
0.1%
72249
0.2%

Type of Travel
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1014.8 KiB
Business travel
89693 
Personal Travel
40187 

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPersonal Travel
2nd rowPersonal Travel
3rd rowPersonal Travel
4th rowPersonal Travel
5th rowPersonal Travel

Common Values

ValueCountFrequency (%)
Business travel89693
69.1%
Personal Travel40187
30.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
travel129880
50.0%
business89693
34.5%
personal40187
 
15.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Class
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1014.8 KiB
Business
62160 
Eco
58309 
Eco Plus
9411 

Length

Max length8
Median length8
Mean length5.755274099
Min length3

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEco
2nd rowBusiness
3rd rowEco
4th rowEco
5th rowEco

Common Values

ValueCountFrequency (%)
Business62160
47.9%
Eco58309
44.9%
Eco Plus9411
 
7.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
eco67720
48.6%
business62160
44.6%
plus9411
 
6.8%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Flight Distance
Real number (ℝ≥0)

Distinct5398
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1981.409055
Minimum50
Maximum6951
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum50
5-th percentile341
Q11359
median1925
Q32544
95-th percentile3831
Maximum6951
Range6901
Interquartile range (IQR)1185

Descriptive statistics

Standard deviation1027.115606
Coefficient of variation (CV)0.5183763561
Kurtosis0.3643059944
Mean1981.409055
Median Absolute Deviation (MAD)594
Skewness0.4667475219
Sum257345408
Variance1054966.467
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
196392
 
0.1%
181288
 
0.1%
163987
 
0.1%
198186
 
0.1%
178986
 
0.1%
176683
 
0.1%
175983
 
0.1%
174882
 
0.1%
202281
 
0.1%
176981
 
0.1%
Other values (5388)129031
99.3%
ValueCountFrequency (%)
5023
< 0.1%
5121
< 0.1%
5221
< 0.1%
5328
< 0.1%
5421
< 0.1%
5522
< 0.1%
5630
< 0.1%
5721
< 0.1%
5815
< 0.1%
5924
< 0.1%
ValueCountFrequency (%)
69511
< 0.1%
69501
< 0.1%
69481
< 0.1%
69241
< 0.1%
69072
< 0.1%
68891
< 0.1%
68821
< 0.1%
68681
< 0.1%
68651
< 0.1%
68371
< 0.1%

Seat comfort
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.838597167
Minimum0
Maximum5
Zeros4797
Zeros (%)3.7%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.392983243
Coefficient of variation (CV)0.490729456
Kurtosis-0.9431930858
Mean2.838597167
Median Absolute Deviation (MAD)1
Skewness-0.09186099833
Sum368677
Variance1.940402316
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
329183
22.5%
228726
22.1%
428398
21.9%
120949
16.1%
517827
13.7%
04797
 
3.7%
ValueCountFrequency (%)
04797
 
3.7%
120949
16.1%
228726
22.1%
329183
22.5%
428398
21.9%
517827
13.7%
ValueCountFrequency (%)
517827
13.7%
428398
21.9%
329183
22.5%
228726
22.1%
120949
16.1%
04797
 
3.7%

Departure/Arrival time convenient
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.990645211
Minimum0
Maximum5
Zeros6664
Zeros (%)5.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.52722437
Coefficient of variation (CV)0.5106671847
Kurtosis-1.089371035
Mean2.990645211
Median Absolute Deviation (MAD)1
Skewness-0.2522824496
Sum388425
Variance2.332414277
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
429593
22.8%
526817
20.6%
323184
17.9%
222794
17.6%
120828
16.0%
06664
 
5.1%
ValueCountFrequency (%)
06664
 
5.1%
120828
16.0%
222794
17.6%
323184
17.9%
429593
22.8%
526817
20.6%
ValueCountFrequency (%)
526817
20.6%
429593
22.8%
323184
17.9%
222794
17.6%
120828
16.0%
06664
 
5.1%

Food and drink
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.851994148
Minimum0
Maximum5
Zeros5945
Zeros (%)4.6%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.443729387
Coefficient of variation (CV)0.5062175136
Kurtosis-0.9867275423
Mean2.851994148
Median Absolute Deviation (MAD)1
Skewness-0.1168129521
Sum370417
Variance2.084354542
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
328150
21.7%
427216
21.0%
227146
20.9%
121076
16.2%
520347
15.7%
05945
 
4.6%
ValueCountFrequency (%)
05945
 
4.6%
121076
16.2%
227146
20.9%
328150
21.7%
427216
21.0%
520347
15.7%
ValueCountFrequency (%)
520347
15.7%
427216
21.0%
328150
21.7%
227146
20.9%
121076
16.2%
05945
 
4.6%

Gate location
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.990421928
Minimum0
Maximum5
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.305969894
Coefficient of variation (CV)0.4367176022
Kurtosis-1.089822453
Mean2.990421928
Median Absolute Deviation (MAD)1
Skewness-0.0530638946
Sum388396
Variance1.705557364
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
333546
25.8%
430088
23.2%
224518
18.9%
122565
17.4%
519161
14.8%
02
 
< 0.1%
ValueCountFrequency (%)
02
 
< 0.1%
122565
17.4%
224518
18.9%
333546
25.8%
430088
23.2%
519161
14.8%
ValueCountFrequency (%)
519161
14.8%
430088
23.2%
333546
25.8%
224518
18.9%
122565
17.4%
02
 
< 0.1%

Inflight wifi service
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.249129966
Minimum0
Maximum5
Zeros132
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.31881752
Coefficient of variation (CV)0.4058986662
Kurtosis-1.12144606
Mean3.249129966
Median Absolute Deviation (MAD)1
Skewness-0.1911228457
Sum421997
Variance1.73927965
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
431560
24.3%
528830
22.2%
327602
21.3%
227045
20.8%
114711
11.3%
0132
 
0.1%
ValueCountFrequency (%)
0132
 
0.1%
114711
11.3%
227045
20.8%
327602
21.3%
431560
24.3%
528830
22.2%
ValueCountFrequency (%)
528830
22.2%
431560
24.3%
327602
21.3%
227045
20.8%
114711
11.3%
0132
 
0.1%

Inflight entertainment
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.383477056
Minimum0
Maximum5
Zeros2978
Zeros (%)2.3%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.346059144
Coefficient of variation (CV)0.3978330937
Kurtosis-0.5327859187
Mean3.383477056
Median Absolute Deviation (MAD)1
Skewness-0.6048282202
Sum439446
Variance1.81187522
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
441879
32.2%
529831
23.0%
324200
18.6%
219183
14.8%
111809
 
9.1%
02978
 
2.3%
ValueCountFrequency (%)
02978
 
2.3%
111809
 
9.1%
219183
14.8%
324200
18.6%
441879
32.2%
529831
23.0%
ValueCountFrequency (%)
529831
23.0%
441879
32.2%
324200
18.6%
219183
14.8%
111809
 
9.1%
02978
 
2.3%

Online support
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.519702803
Minimum0
Maximum5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.30651069
Coefficient of variation (CV)0.3711991505
Kurtosis-0.8105718251
Mean3.519702803
Median Absolute Deviation (MAD)1
Skewness-0.57536498
Sum457139
Variance1.706970184
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
441510
32.0%
535563
27.4%
321609
16.6%
217260
13.3%
113937
 
10.7%
01
 
< 0.1%
ValueCountFrequency (%)
01
 
< 0.1%
113937
 
10.7%
217260
13.3%
321609
16.6%
441510
32.0%
535563
27.4%
ValueCountFrequency (%)
535563
27.4%
441510
32.0%
321609
16.6%
217260
13.3%
113937
 
10.7%
01
 
< 0.1%

Ease of Online booking
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.47210502
Minimum0
Maximum5
Zeros18
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.305559648
Coefficient of variation (CV)0.3760138707
Kurtosis-0.9106542561
Mean3.47210502
Median Absolute Deviation (MAD)1
Skewness-0.4917196477
Sum450957
Variance1.704485995
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
439920
30.7%
534137
26.3%
322418
17.3%
219951
15.4%
113436
 
10.3%
018
 
< 0.1%
ValueCountFrequency (%)
018
 
< 0.1%
113436
 
10.3%
219951
15.4%
322418
17.3%
439920
30.7%
534137
26.3%
ValueCountFrequency (%)
534137
26.3%
439920
30.7%
322418
17.3%
219951
15.4%
113436
 
10.3%
018
 
< 0.1%

On-board service
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.465075454
Minimum0
Maximum5
Zeros5
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.270835582
Coefficient of variation (CV)0.3667555293
Kurtosis-0.7850230753
Mean3.465075454
Median Absolute Deviation (MAD)1
Skewness-0.5052698753
Sum450044
Variance1.615023077
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
440675
31.3%
531724
24.4%
327037
20.8%
217174
13.2%
113265
 
10.2%
05
 
< 0.1%
ValueCountFrequency (%)
05
 
< 0.1%
113265
 
10.2%
217174
13.2%
327037
20.8%
440675
31.3%
531724
24.4%
ValueCountFrequency (%)
531724
24.4%
440675
31.3%
327037
20.8%
217174
13.2%
113265
 
10.2%
05
 
< 0.1%

Leg room service
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.485902371
Minimum0
Maximum5
Zeros444
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.292225983
Coefficient of variation (CV)0.3707005663
Kurtosis-0.8413209574
Mean3.485902371
Median Absolute Deviation (MAD)1
Skewness-0.4964400708
Sum452749
Variance1.669847991
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
439698
30.6%
534385
26.5%
322467
17.3%
221745
16.7%
111141
 
8.6%
0444
 
0.3%
ValueCountFrequency (%)
0444
 
0.3%
111141
 
8.6%
221745
16.7%
322467
17.3%
439698
30.6%
534385
26.5%
ValueCountFrequency (%)
534385
26.5%
439698
30.6%
322467
17.3%
221745
16.7%
111141
 
8.6%
0444
 
0.3%

Baggage handling
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1014.8 KiB
4
48240 
5
35748 
3
24485 
2
13432 
1
7975 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row4
3rd row4
4th row1
5th row2

Common Values

ValueCountFrequency (%)
448240
37.1%
535748
27.5%
324485
18.9%
213432
 
10.3%
17975
 
6.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
448240
37.1%
535748
27.5%
324485
18.9%
213432
 
10.3%
17975
 
6.1%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Checkin service
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.340806899
Minimum0
Maximum5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.260582285
Coefficient of variation (CV)0.3773286883
Kurtosis-0.7935110538
Mean3.340806899
Median Absolute Deviation (MAD)1
Skewness-0.3924424812
Sum433904
Variance1.589067697
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
436481
28.1%
335538
27.4%
527005
20.8%
215486
11.9%
115369
11.8%
01
 
< 0.1%
ValueCountFrequency (%)
01
 
< 0.1%
115369
11.8%
215486
11.9%
335538
27.4%
436481
28.1%
527005
20.8%
ValueCountFrequency (%)
527005
20.8%
436481
28.1%
335538
27.4%
215486
11.9%
115369
11.8%
01
 
< 0.1%

Cleanliness
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.705759162
Minimum0
Maximum5
Zeros5
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.151773912
Coefficient of variation (CV)0.3108064667
Kurtosis-0.2088886554
Mean3.705759162
Median Absolute Deviation (MAD)1
Skewness-0.7560006872
Sum481304
Variance1.326583144
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
448795
37.6%
535916
27.7%
323984
18.5%
213412
 
10.3%
17768
 
6.0%
05
 
< 0.1%
ValueCountFrequency (%)
05
 
< 0.1%
17768
 
6.0%
213412
 
10.3%
323984
18.5%
448795
37.6%
535916
27.7%
ValueCountFrequency (%)
535916
27.7%
448795
37.6%
323984
18.5%
213412
 
10.3%
17768
 
6.0%
05
 
< 0.1%

Online boarding
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.352587003
Minimum0
Maximum5
Zeros14
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.298714502
Coefficient of variation (CV)0.387376823
Kurtosis-0.9380499192
Mean3.352587003
Median Absolute Deviation (MAD)1
Skewness-0.3664956098
Sum435434
Variance1.686659358
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
435181
27.1%
330780
23.7%
529973
23.1%
218573
14.3%
115359
11.8%
014
 
< 0.1%
ValueCountFrequency (%)
014
 
< 0.1%
115359
11.8%
218573
14.3%
330780
23.7%
435181
27.1%
529973
23.1%
ValueCountFrequency (%)
529973
23.1%
435181
27.1%
330780
23.7%
218573
14.3%
115359
11.8%
014
 
< 0.1%

Departure Delay in Minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct466
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.71371266
Minimum0
Maximum1592
Zeros73356
Zeros (%)56.5%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q312
95-th percentile77
Maximum1592
Range1592
Interquartile range (IQR)12

Descriptive statistics

Standard deviation38.07112622
Coefficient of variation (CV)2.587458862
Kurtosis100.6445463
Mean14.71371266
Median Absolute Deviation (MAD)0
Skewness6.82198031
Sum1911017
Variance1449.410651
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
073356
56.5%
13682
 
2.8%
22855
 
2.2%
32535
 
2.0%
42309
 
1.8%
52136
 
1.6%
61884
 
1.5%
71748
 
1.3%
81618
 
1.2%
91552
 
1.2%
Other values (456)36205
27.9%
ValueCountFrequency (%)
073356
56.5%
13682
 
2.8%
22855
 
2.2%
32535
 
2.0%
42309
 
1.8%
52136
 
1.6%
61884
 
1.5%
71748
 
1.3%
81618
 
1.2%
91552
 
1.2%
ValueCountFrequency (%)
15921
< 0.1%
13051
< 0.1%
11281
< 0.1%
10171
< 0.1%
9781
< 0.1%
9511
< 0.1%
9331
< 0.1%
9301
< 0.1%
9211
< 0.1%
8591
< 0.1%

Arrival Delay in Minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct472
Distinct (%)0.4%
Missing393
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean15.09112884
Minimum0
Maximum1584
Zeros72753
Zeros (%)56.0%
Negative0
Negative (%)0.0%
Memory size1014.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q313
95-th percentile78
Maximum1584
Range1584
Interquartile range (IQR)13

Descriptive statistics

Standard deviation38.46565024
Coefficient of variation (CV)2.548891514
Kurtosis95.11711419
Mean15.09112884
Median Absolute Deviation (MAD)0
Skewness6.670124611
Sum1954105
Variance1479.606248
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
072753
56.0%
12747
 
2.1%
22587
 
2.0%
32442
 
1.9%
42373
 
1.8%
52083
 
1.6%
62021
 
1.6%
71794
 
1.4%
81751
 
1.3%
91566
 
1.2%
Other values (462)37370
28.8%
ValueCountFrequency (%)
072753
56.0%
12747
 
2.1%
22587
 
2.0%
32442
 
1.9%
42373
 
1.8%
52083
 
1.6%
62021
 
1.6%
71794
 
1.4%
81751
 
1.3%
91566
 
1.2%
ValueCountFrequency (%)
15841
< 0.1%
12801
< 0.1%
11151
< 0.1%
10111
< 0.1%
9701
< 0.1%
9521
< 0.1%
9401
< 0.1%
9241
< 0.1%
9201
< 0.1%
8601
< 0.1%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

satisfactionGenderCustomer TypeAgeType of TravelClassFlight DistanceSeat comfortDeparture/Arrival time convenientFood and drinkGate locationInflight wifi serviceInflight entertainmentOnline supportEase of Online bookingOn-board serviceLeg room serviceBaggage handlingCheckin serviceCleanlinessOnline boardingDeparture Delay in MinutesArrival Delay in Minutes
0satisfiedFemaleLoyal Customer65Personal TravelEco2650002242330353200.0
1satisfiedMaleLoyal Customer47Personal TravelBusiness246400030223444232310305.0
2satisfiedFemaleLoyal Customer15Personal TravelEco21380003202233444200.0
3satisfiedFemaleLoyal Customer60Personal TravelEco6230003343110141300.0
4satisfiedFemaleLoyal Customer70Personal TravelEco3540003434220242500.0
5satisfiedMaleLoyal Customer30Personal TravelEco18940003202254554200.0
6satisfiedFemaleLoyal Customer66Personal TravelEco227000325555055531715.0
7satisfiedMaleLoyal Customer10Personal TravelEco18120003202233454200.0
8satisfiedFemaleLoyal Customer56Personal TravelBusiness730003535440154400.0
9satisfiedMaleLoyal Customer22Personal TravelEco1556000320222453423026.0

Last rows

satisfactionGenderCustomer TypeAgeType of TravelClassFlight DistanceSeat comfortDeparture/Arrival time convenientFood and drinkGate locationInflight wifi serviceInflight entertainmentOnline supportEase of Online bookingOn-board serviceLeg room serviceBaggage handlingCheckin serviceCleanlinessOnline boardingDeparture Delay in MinutesArrival Delay in Minutes
129870satisfiedFemaledisloyal Customer70Personal TravelEco1674545155553245455446.0
129871satisfiedFemaledisloyal Customer35Personal TravelEco32875453252245443290.0
129872satisfiedFemaledisloyal Customer69Personal TravelEco22405453454454434440.0
129873satisfiedFemaledisloyal Customer63Personal TravelEco1942554434335253537NaN
129874satisfiedFemaledisloyal Customer11Personal TravelEco27525552252235354250.0
129875satisfiedFemaledisloyal Customer29Personal TravelEco17315553252233444200.0
129876dissatisfiedMaledisloyal Customer63Personal TravelBusiness208723242113233121174172.0
129877dissatisfiedMaledisloyal Customer69Personal TravelEco232030333224434232155163.0
129878dissatisfiedMaledisloyal Customer66Personal TravelEco245032323223323212193205.0
129879dissatisfiedFemaledisloyal Customer38Personal TravelEco430734333334555333185186.0